Multilocus sequence analysis reveals genetic diversity in Staphylococcus aureus isolate of goat with mastitis persistent after treatment with enrofloxacin

Staphylococcus aureus is one of the main bacterial agents responsible for cases of mastitis in ruminants, playing an important role in the persistence and chronicity of diseases treated with antimicrobials. Using the multilocus sequence typing technique, network approaches and study of the population diversity of microorganisms, we performed analyzes of S. aureus (ES-GPM) isolated from goats with persistent mastitis (GPM). The most strains of ES-GPM were categorically different phylogenetically from the others and could be divided into two lineages: one with a majority belonging to ES-GPM and the other to varied strains. These two lineages were separated by 27 nuclear polymorphisms. The 43 strains comprised 22 clonal complexes (CCs), of which the ES-GPM strains were present in CC133, CC5 and a new complex formed by the sequence type 4966. The genetic diversity of some alleles showed be greater diversity and polymorphism than others, such as of the aroE and yqiL genes less than glpF gene. In addition, the sequences ES-GPM to the arc gene and glpF alleles showed the greatest number of mutations for ES-GPM in relation to non-ES-GPM. Therefore, this study identified genetic polymorphisms characteristic of S. aureus isolated from milk of goats diagnosed with persistent mastitis after the failed treatment with the antibiotic enrofloxacin. This study may help in the future to identify and discriminate this agent in cases of mastitis, and with that, the most appropriate antibiotic treatment can be performed in advance of the appearance of persistent mastitis caused by the agent, reducing the chances of premature culling and animal suffering.


Results
A total of 43 isolates were analyzed in this study. Within these 43, 18 (ES-GPM) isolates from the milk of animals were diagnosed with GPM at Capril UFV, named Minas Gerai-LDBAC (Table S1). In addition, another 25 isolates of animals diagnosed with mastitis were selected from the PubMLST database (https:// pubml st. org/ organ isms/ staph yloco ccus-aureus/); these are shown in supplementary Table S3. As a result of this search of the database, 22 bovine mastitis and three ovine isolates were referenced. The total sequences obtained originated from seven different Brazilian states: Rio de Janeiro (n = 7), Pernambuco (n = 7), São Paulo (n = 7), Minas Gerais (n = 1), Rio Grande do Sul (n = 1), Paraná (n = 3), Santa Catarina (n = 3) and Minas Gerais-LDBAC (n = 18) regarding the ES-GPM sequences ( Fig. 1a and b). These analyzed sequences represent all the sequences deposited at (https:// pubml st. org/ organ isms/ staph yloco ccus-aureus/) referring to animal mastitis in Brazil up to 09/28/2019. The MICs ranged from 0.125 to 16 μg/mL in the isolates prior to treatment, whereas in the isolates after treatment with enrofloxacin antibiotic, the antimicrobial MICs ranged from 0.19 to 16 μg/mL, with only samples of IDs 33784 and 33785 showing 16 μg/mL, as previously reported 22 . Phylogeography, STs and CCs. In Fig. 1, the phylogeographic distribution of S. aureus isolates are characterized, relating the phylogeny of the strains to the isolation sites of the sample, which showed little association between the origin of the isolate and the phylogenetic structure found. Despite the apparent small number of sequences as shown in Fig. 1, these MLST data represent the totality of Brazilian sequences deposited from S. aureus that infected dairy animals. However, strains related to ovine and caprine mastitis apparently had isolates from bovine mastitis as common ancestors. In addition, ES-GPM-related strains were highly associated in clades or clusters, which may be because few different strains causing GPM to circulate among animals within the same goat herd.
On the other hand, strains related to bovine mastitis demonstrated phylogenetic ramifications along the branches of the phylogenetic tree ( Fig. 1), regardless of the Brazilian region studied, which implies greater nucleotide polymorphism. Strains related to ES-GPM are apparently little related phylogenetically to the other two host species (ovine and bovines) and show no clear phylogenetic distinction, as can be seen by strains from bovine ID 1555 and ovine ID 3446 (Fig. 1). However, even though most of the ES-GPM-related strains remained close phylogenetically, the strain ID 33770 (ST 5; Fig. 1) distanced itself from the other ES-GPM strains, becoming close to those from ovine and bovines. In addition, IDS strains 33784 and 33770 were harvested prior to treatment with enrofloxacin as listed in Table S1, and this possibly determined that they were phylogenetically more distant from other ES-GPM strain and that also presented as a different ST.
The corresponding phylogenetic analyses (Fig. 2a,b) showed high consistency with the Multilocus Sequence Tree (MS tree) (Fig. 2c), providing better resolution when considering the seven (AT) genes characterized for S. aureus (Fig. 2a,b) than if a single housekeeping gene were used. In addition, these analyses also presented robust topology with high bootstrap values, dividing the studied population into two lineages (Fig. 2a). The phylogenetic tree based on housekeeping genes or concatenated maintenance genes, as shown in Fig. 2b, indicated moderate differentiation among populations, as also shown in Fig. 1b. Nevertheless, the 43 strains were divided into two clusters, called Lineage 1 (L1) and Lineage 2 (L2), being characterized by four CCs and another 18 CCs, respectively.
Accordingly, each lineage cluster (Fig. 2a) represents one or more phylogenetic associations that, concomitantly, also determined the formation of CCs in Fig. 2c, and thus 20 STs with their concatenated sequences formed L1, and 23 formed L2. The high nucleotide identity caused L1 to form a smaller cluster of CCs (Fig. 2) compared to L2 (Fig. 2).  5.93.0 (at http:// micro react. org). (b) Phylogenetic tree structured with metadata in Microreact, with the colors referenced to each Brazilian state analyzed. The tip labels were detailed with the source of the samples, identification of the isolates (ID) and lastly this is the year in which it was collected and analyzed. The tree was visualized using Figtree software and the fully interactive version of our phylogenetic tree and including geospatial information can be found at https:// micro react. org/ proje ct/ puA-MDz8o. Created with Microreact version 5.93.0 (at ) and by  (Fig. 4). Interestingly, the distribution patterns of CCs found in the MS tree ( Fig. 3) were like those of the haplotype networks (Figs. 4 and 5). Thus, H20, H22, H5 and H9 (Fig. 4) corresponded to CC133, CC4966, CC745, CC1729, respectively, while H21, H8, H10 and H4 corresponded to C5, CC1730, CC1728 and CC744. In addition, C133 ( Fig. 3) or H20 (Fig. 4) had between 24 and 27 mutations separating them from the other haplotypes, and the groups corresponding to C5 (Fig. 3) or H21 (Fig. 4) were separated from the rest of the network groups by 6 to 7 mutations. However, most haplotypes (72.73%) were separated by one mutation, while the rest (23.81%) were separated by between 2 and 27 mutations.
The high number of mutations presented from H22 established the separation of the groups, and Line 1 (L1; 22.73%) was formed by haplotypes H5, H9, H20 and H22. With this, H22 was positioned exactly where the two different clusters/strains are separated (Figs. 4 and 5), between H22 and H19, a result corresponding to the analysis of Fig. 2b. On the other hand, the H21, H8, H10 and H4 haplotypes were part of the Line 2 (L2) lineage, with H21 or C5 being the only GPM and ES-GPM haplotypes that did not group with L1.
Of the 22 haplotypes found, only 13% were goats and H20 was the most common (n = 16, 37.2%), formed by ES-GPM collected at UFV, Minas Gerais. The H20 haplotype (Minas Gerais-LDBAC) from goat strains proved to be a common ancestor of the H22 (Minas Gerais-LDBAC), H9 (Pernambuco), and H5 (Rio de Janeiro) haplotypes (Fig. 5). However, the H19 haplotype (Santa Catarina) was the haplotype that differed most in the number of mutations from ancestors and other haplotypes; however, a greater number of samples from this state could help to reduce this isolation and complement the network.
Other haplotypes belong to the lineage L1, H20 and H5 with bovine and ovine sequences, respectively, showed only one mutation point between them. The H19 haplotype, also belonging to this strain, was the most distinct from the others. The L2 line (77.27%) was formed by the other haplotypes, with cattle representing 82.35% of this line (Fig. 4). The number of mutations represented in the haplotypic network of Figs. 4 and 5, suggest that the L1 strain can be highly divergent from the isolates of the L2 strain.
The H6 haplotype, referring to the bovine S. aureus sequences, was the haplotype with the greatest dispersion among the states, presenting itself in the states of the southern region (Santa Catarina and Rio Grande do Sul) and in the southeast region (Rio de Janeiro). In addition, H16 presented itself as a common ancestor and divisor of the haplotypical network, branching out several haplotypes, such as H13 and H12 (Pernambuco) and H1  (Table 1). While the haplotype network analysis of the concatenated housekeeping genes (large population) revealed a greater specificity of the haplotypes for their host, in the analysis of the seven housekeeping genes separately there was predominantly an aggregation of the isolates in a non-host-specific manner ( Fig. S1 and S2). The haplotypes referring to milk samples of cow or cattle origin appear to be more specific; at the time, the haplotypes referring to ovine and goats showed greater plurality. The frequency of different hosts that shared AT of S. aureus in the same haplotype was low: 9% on average. Only the H4 haplotype for the yqiL gene showed 100%, in which it grouped three different hosts in the same proportion. The yqiL and aroE genes showed the highest values of haplotypic diversity: 0.7907 and 0.7375, respectively, while glpF had the lowest value: 0.5548 (Table 1). The greatest genetic variability observed for the yqiL and aroE genes, to the detriment of the haplotypic diversity values, demonstrated that these genes are more mutable in relation to glpF.  The genetic diversity shown in the arc, tpi, pta and gmk genes were very similar ( Table 1). The arc and tpi genes showed a remarkably similar distribution of haplotypes in each host. However, the pta and gmk genes showed more specific haplotypes for ovines and goats than the other genes shown in supplementary Figure S1 and S2. However, the pta gene ( Fig. S2) appears to have a greater haplotype specificity for goats and ovines than the other genes.
Sequence diversity. In order to assess the general sequence diversity of the seven MLST loci of the isolates under study, the average GC (genomic DNA base composition) content, the number of polymorphic sites, the ratio of non-synonymous (dN) to synonymous (dS) substitutions and nucleotide diversity (π) were calculated and are shown in Table 2. Sequence alignment of each MLST locus showed no insertion/deletion. Concatenated sequences for the seven loci were 3186 bp in length with a diversity index π of 0.00670. The mean GC content of the MLST gene fragments ranged from 30.10% for aroE to 40.9% for glpF. The nucleotide diversity index π ranged from 0.00346 (glpF) to 0.01119 (aroE; Table 2). The number of polymorphic sites varied from 1.07% for glpF to 2.93% for aroE, the most polymorphic locus (Table 2). Interestingly, the glpF sequence showed the lowest haplotypic diversity of all (Table 1) and the highest average G + C content ( Table 2).    17 . In general, the greatest number of mutations was observed for the whole population of sequences (Table 2), with emphasis for aroE and yqiL alleles. In addition, the sequences of ES-GPM (supplementary Table S4) also demonstrated that the aroE and yqiL alleles were present more polymorphisms. Thereby, the sequences ES-GPM to the arcC (66%) and glpF (60%) alleles showed the greatest number of mutations for ES-GPM in relation to non-ES-GPM (Table S4). The phi test is a rapid statistically efficient test for recombination. The P-value generated from the phi test for all 22 STs was p = 4.053E−5 (Table 3) and for L2 was p = 5.413E−03, which indicates significant incidence of recombination across the whole population and L2. However, for L1 it was not possible to establish the value (NA) by the program used, which suggested that there was no significant evidence of recombination for L1 or, if not, partly since there are very few informative sites. Furthermore, the phi test for each AT did not demonstrate a significant result.

Nonsense mutations in
The detecting per-site ρ/θ (rho/theta) value for the 22 STs was 2.16E01 ( Table 3), suggesting that point mutation was 4.63-fold more likely to occur than recombination at the level of whole population. The values of ρ/θ ratio were 6.24E−01 for lineage L2, which likewise suggests point mutation in this lineage to be 1.62-fold more likely to occur than recombination. However, the ρ/θ ratio for L1 could indicate a high recombination rate, but the phi test did not indicate that there would be evidence, as previously reported in the text. The IAs values were 0.0052 (P = < 1.00E10−04) and 0.0014 (P = < 1.00E10−04; Table 3) for all 22 STs and L2, respectively, indicating a tendency toward linkage disequilibrium between the alleles of L2 at the level of whole population (all 22 STs). This result indicates that clonal relationship and recombination were not sufficient to break down the linkage disequilibrium for all 22 STs and L2. However, for L1 a tendency toward free recombination between the alleles in lineage was suggested.

Discussion
In recent years, persistent infections have shown an important role in the relapse and recalcitrance of infections, moreover they are likely to help spread antibiotic resistance 23 , being persistence a potential critical trigger for therapeutic failures 4 . Overall, phylogenetic analyses resulted in the establishment of two lineages of cases of bovine, ovine and caprine mastitis in different states of Brazil, and highly clonal ES-GPM unresponsive to the antibiotic enrofloxacin, in a single herd of goats. The 43 isolates that formed the 22 CCs provided further www.nature.com/scientificreports/ evidence that geographic isolation was not the primary factor leading to moderate genetic differentiation of S. aureus ES-GPM and Non-ES-GPM. S. aureus isolates from animals are commonly assigned to host-specific CCs 24 , and CC133 was the main group of characterized isolates belonging to ES-GPM in this study, in addition to the presence of the CC5 complex. In previous studies, CC133 has been associated with small ruminants 25 and has been specifically assigned to goat, ovine and bovine isolates in several different countries 24 and in Brazil 26 . According Aires-de-Sousa et al. 27 , the CC133 of S. aureus may have adapted to small ruminants, with human ancestry, due to the adaptive diversification of the genome resulting from allelic variation, the loss of genes or horizontal acquisition of mobile genetic elements.
The CC5 was also identified as belonging to the ES-GPM association but in a separate lineage from CC133. In addition, CC5 is recognized as a common clonal complex and generalized to several hosts, and is among the most prevalent clones that cause hospital infections in humans and causes methicillin-resistant S. aureus (MRSA) 28 . CC5 has also been characterized in bovine mastitis 29 , in buffalo milk 30 and also isolated in foods such as samples of milk and dairy products 31 . Consequently, the persistent S. aureus lineage, may be silently acting on persistent infections along with CC133 and CC5 in other cases of mastitis or even in other clonal complexes not studied here.
RABELLO et al. 32 suggests that the prevalence of a limited number of clones is strictly related to mastitis in different herds and HOEKSTRA et al. 33 demonstrated that the same genotype of S. aureus can cause clinical and subclinical mastitis in goats and ovine. However, diversity analyses implemented using different techniques indicate that there is a difference between sequences by type of mastitis, such as ES-GPM. This indicates that questions of agent selection or challenge 3 can trigger the persistence 34 of mastitis, and with this, questions related to the history of choosing certain classes of antibiotics in treatments and adaptations of the agent must be taken into account in the analysis of MLST in mastitis, as in cases of MRSA or MSSA (methicillin-susceptible S. aureus). As the mutations that lead bacteria to persist to enrofloxacin and other fluoroquinolones are still being determined 12,35-37 , more complete studies, with total genome sequencing 38 and biocomputational analyses should be better implemented in the future to elucidate these issues.
As was reported above, we did not find high genetic variability in this study, but nevertheless, it has been reported that cases of animal mastitis by S. aureus in small regions of Brazil are related to low genetic variability and a small clonal population 27 . Furthermore, a greater number of sequences could better demonstrate the genetic diversity, patterns of distribution and evolutionary history of this agent in dairy animals. On the other hand, in cases with a large number of isolates collected from different regions of Norway, the strains were closely related genetically, and their clonal population was responsible for most cases of mastitis by S. aureus in domestic ruminants 39 . The low diversity of S. aureus in milk samples in studies with ruminants 24,27,40 may be related to its later diversification from S. aureus associated with humans through a combination of foreign DNA acquisition and gene decay 41 and also that their strains, in general, are young in relation to the species 38 . However, in this study the strains related to bovine mastitis demonstrated greater nucleotide polymorphism possibly associated with evolutionary pressures under the pathogen due to factors related to adaptation of a species to optimize the process of infection, escape host immune response, and also as possible adaptation to a different environmental niches and use of antibiotics that impacts on the evolution of certain core genes 42 .
In general, for isolates of S. aureus from invasive disease the r/m per allele parameter is approximately 1:1 43 , which means that the isolates would have the same probability of diversifying in their large population by recombination as by point mutation. However, in humans, the genetic variability of S. aureus may be mainly associated with point mutation, since alleles are up to 15 times more likely to change by point mutation than by recombination 44 , similar to the results of our study. Conversely, in S. aureus adapted to cattle, it was found by MLST alleles that a nucleotide substitution was more likely to be due to recombination than to point mutation, and equally likely in humans 45 . Nevertheless, there is an extensive difference between the S. aureus genomes associated with cattle and those isolated from humans 41 . These differences between genomes and their r/m ratios demonstrate that there are gaps in the understanding of the diversification behavior of S. aureus between different hosts. In addition, subtle changes in strain due to single non-synonymous point mutation in S. aureus may be involved with persistence to antibiotics 46 as diagnosed in GPMs unresponsive to treatment with enrofloxacin.
Nevertheless, there is evidence of widespread homologous recombination in the core genome of S. aureus in studies of animal-associated strains of ovines, bovines, and poultry 38 , due to the performance of mobile genetic elements (MGEs), which generate a landscape of hotspots in the core genome 38 . Furthermore, there are concerns about the exchange of S. aureus CCs between animals and humans 47,48 , in which strains of this agent have been previously described in the literature as being capable of causing disease in animals and humans 49 . Therefore, the probability of widespread homologous recombination between S. aureus from different hosts may be high, and the potential of this agent to infect both humans and animals may indicate the chances of a greater or lesser degree of recombination or point mutation not having a noticeably clear pattern in S. aureus, unless characteristics other than the infection chain are intrinsically defined in the analysis metadata, such as those of ES-GPM.
The greater diversity and polymorphism of some alleles of S. aureus may be associated with adaptive mutations due to response to environmental changes or a switch in host species, since this agent presents tropism to several hosts and in particular, antibiotic exposure 50 . In addition, housekeeping genes are genes associated with metabolic maintenance, which demonstrates that adaptations in their genomes may be affecting metabolism in response to distinct nutrient availability 50 . Of all genes, yqiL and aroE showed the highest values of nonsynonymous substitutions and mutations. The yqiL locus in S. aureus demonstrated a potential signature of recombination 45 compared to the other six gene fragments 38 , but the role of this locus in infections is poorly understood. However, a high number of non-synonymous substitutions may suggest that the removal of deleterious mutations by purification selection should be relatively slow. Gene aroE has also been shown to contribute to the chronicity of S. aureus infection, such as the invasiveness and cytotoxicity of the agent, with an increase in the Scientific Reports | (2021) 11:17252 | https://doi.org/10.1038/s41598-021-96764-z www.nature.com/scientificreports/ load of intracellular bacteria 51 , and because of that, this same locus can play an important role in the persistence of infection such as ES-GPM in goats. Moreover, the aroE gene locus is shown to be in a region of the genome with an excess of homologous recombination, most likely in MGEs 38 and with that it can confer an increased capacity to colonize and infect ruminants 52 . Interestingly, the glpF sequence, among all others, was shown to be more conserved, with the lowest haplotypic diversity and the highest average G + C content, and also presenting low numbers of mutations, even though it is present in a region of the genome with excess homologous recombination 38 . The highest average G + C content may be linked to the pathogenic potential for the host genome 53 . However, we can emphasize that the glpF gene has a role in the formation of the L form in bacteria 54 , which is directly involved in antibiotic tolerance or persistence 55 in S. aureus, such as to ampicillin or norfloxacin 3 . This demonstrates that glpF sequence, even though it is a member of the metabolic housekeeping genes and is present in the core genome, it is directly linked to persistence in infections; therefore, a high glpF sequence polymorphism may not be advantageous for colonization of the host as in ES-GPM for enrofloxacin and thus is shown to be more conserved within the other six alleles.

Conclusion
In this study we identified 27 specific genetic mutations for strains of ES-GPM (S. aureus isolated from goats with persistent mastitis) that may help in the future to discriminate S. aureus in cases of persistent goat mastitis (GPM). In addition, in 22 CCs that we found, CC133, CC5 and a new ST 4966 were specifically related to ES-GPM. We describe polymorphisms in specific alleles arcC and glpF genes, that showed a greatest number of mutations for ES-GPM in relation to non-ES-GPM. Furthermore, the identification of S. aureus and these polymorphisms genetics in persistent bacterial infections together with the MLST, can assist in decision making for the appropriate choice of protocol and of an antibiotic for the treatment of mastitis persists in goats. We hope that future studies can better clarify the persistence of this agent in certain antibiotic treatments.

Methods
Bacterial strains. In this study, we used two data sources, the first being cataloged by our research group with 18 isolates of S. aureus from goats diagnosed with GPM, recovered from milk samples, and treated with enrofloxacin antibiotic. The animals are kept under intensive farming in a free stall regime, with a high-level mechanical milking system and automatic cleaning of milk pipes. The harvest of samples was established in the Capril UFV, located in the mesoregion of Zona da Mata of Minas Gerais, Brazil. The isolates were identified by phenotypic and genotypic tests, as well as by methods of bacteriological examination and antibiotic sensitivity tests as previously reported 12,56 .
Briefly, the animals selected were examined and diagnosed with clinical mastitis caused by S. aureus. Firstly, were evaluated for signs of clinical mastitis and the presence of at least visually abnormal milk (i.e., the presence of flakes, clots, blood, or serous milk), as well as changes in the mammary gland, such as an increased volume and body temperature, and the presence of pain, redness during forestripping performed at the milking parlour, in the presence of a veterinarian. After antibiogram results, these animals were treated with enrofloxacin (KINETOMAX -Bayer). Minimal inhibitory concentrations (MICs) for enrofloxacin were performed using the Etest method (BioMérieux, Marcy l'Étoile, France). However, due to acute mastitis and the need for rapid treatment, only after completion of seven days of enrofloxacin treatment were these MIC results available. Twenty-one days after the completion of treatment, these animals continued to have clinical mastitis. New milk samples were collected, and S. aureus was isolated again. Therefore, the diagnosis of persistent mastitis (GPM) was based on the detection of the same species of agent in more than one consecutive sampling [57][58][59] . Thus, 18 isolates of S. aureus were obtained (nine before treatment and nine after treatment) and were grouped in this study as ES-GPM.
The second source of data for the analyzes were another 25 sequences of S. aureus isolates recovered from milk samples, these being selected from mastitis cases obtained from ovine and bovine isolates from the MLST online database (https:// pubml st. org/ organ isms/ staph yloco ccus-aureus/). The search terms used were: country (Brazil), source (Milk) and disease (Mastitis), totaling 43 isolates from seven Brazilian states (Table S3).
Genomic DNA extraction, MLST locus amplification and sequencing. The seven housekeeping genes of S. aureus isolates obtained from the GPMs were sequenced: carbamate kinase (arcC), shikimate dehydrogenase (aroE), glycerol kinase (glpF), guanylate kinase (gmk), phosphate acetyltransferase (pta), triosephosphate isomerase (tpi), acetyl coenzyme A acetyltransferase (yqiL ; Table S2), as previously described in methods and evaluated at https:// pubml st. org/ organ isms/ staph yloco ccus-aureus/ 60 . DNA from S. aureus isolates were extracted using the PROMEGA kit following the manufacturer's protocol and fragments amplified according to the protocol described by ENRIGHT et al. 61 . The gene and primer specifications are shown in Table S2. MLST locus amplification was performed in 50 µL reaction volumes containing 0.5 µL DNA, 0.5 µg of each primer, 1U Taq DNA polymerase (Qiagen, Crawley, UK), 5 µL buffer 103 (supplied with Taq Polymerase) and 0.2 mM deoxynucleoside triphosphates (Perkin-Elmer Applied Biosystems; Foster City, California). Initial denaturation was for 5 min at 95 °C, followed by 30 cycles at 55 °C for 1 min, extension at 72 °C for 1 min and denaturation at 95 °C for 1 min, followed by the final extension at 72 °C for 5 min. The amplified products were sent for sequencing at MACROGEN, INC. (Seoul, South Korea) using capillary electrophoresis. Alignment, editing and curation of S. aureus MLST sequences. The 65 and phylogenetic trees were calculated in two runs with 1,000,000 (one million) generations and a sampling frequency of 100 (one hundred). The parameter convergence was analyzed in Tracer version 1.6 (http:// tree. bio. ed. ac. uk/ softw are/ tracer) and 25% of the trees generated were burned to produce the consensus tree. The phylogenetic tree and geospatial information was visualized together with associated metadata using Microreact Web server version 5.93.0 66  Population structure and recombination analyses. Strain relationships were analyzed using the goe-BURST algorithm 70 , as implemented in the software PHYLOViZ 71 to cluster the STs into CCs based on the most stringent definition. Global optimal eBURST implemented by PHYLOViZ was used to cluster STs, generating a multi-locus sequence tree (MS tree) to visualize possible evolutionary relationships between STs. The pairwise homoplasy index (phi) test 72 implemented in SplitTree4 73 for recombination was performed, and a P-value of < 0.05 indicated that recombination existed. The LDhat program 74 implemented in Recombination Detection Program (RDP) v.4.97 75 was used to calculate the per-site ρ/θ ratio based on the concatenated sequences of seven loci with 1,000,000 MCMC updates. The parameters ρ and θ represent the rates of recombination and mutation, respectively. Linkage disequilibrium from allelic data was evaluated by calculating the standardized index of association (IAs) using LIAN v3.735 76 in web interface (http:// guani ne. evolb io. mpg. de/ cgi-bin/ lian/ lian. cgi. pl/ query). The null hypothesis of complete linkage equilibrium (IAs > 0; presence of linkage disequilibrium or clonality) was tested by using Monte Carlo methods with 10,000 iterations on allelic profile 17 . If there is linkage equilibrium because of frequent recombination events, the expected value of IAs is zero, which suggests no association between alleles at different loci; if IAs are statistically significant different from zero, alleles are suggested with genetic linkage 77 .
Genetic network. Genetic networks present an alternative view of genealogies represented by bifurcated structures of phylogenetic trees, and the possible dispersal routes of S. aureus isolates in the Brazilian dairy milk were predicted following the methodology described by Vidigal et al. 78 . To reconstruct the network, sequences of AT and ST genes were grouped into haplotypes using DnaSP v6 68 . Following this, the network was constructed using the network 4.6.1.0 (http:// www. fluxus-techn ology. com) and the Median Joining algorithm (MJ) 79 .
Ethics statement. The experimental protocol was approved by the Ethics Committee (Comissão de ética no uso de animais -CEUA) of the Federal University of Viçosa, according to the protocol number 43/2016. The methods were carried out in accordance with the approved guidelines. In addition, this experiment was conducted by Bacterial Diseases Laboratory (LDBAC) and the Molecular Biology Laboratory (BIOMOL) located at the Veterinary Department (DVT) of the Universidade Federal de Viçosa (UFV), Viçosa, Minas Gerais.

Availability of data and materials
All data generated and/or analyzed during this study are included in this published article, and at Microreact interactive viewer 66 [https:// micro react. org/ proje ct/ puA-MDz8o]. Additionally, the accession numbers of the sequences or reference codes used in this study are called IDs and can be seen in the supplementary tables S1 and S3, with their isolated characteristics and their corresponding AT and ST. These data are publicly available and accessible online at the S. aureus PubMLST database [https:// pubml st. org/ organ isms/ staph yloco ccus-aureus] (IDs: 33768 -33785) 21 .